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General non-asymptotic and asymptotic formulas 
in channel resolvability and identification capacity 
and their application to wire-tap channel 

Masahito Hayashi 


Abstract — Several non-asymptotic formulas are established in 
channel resolvability and identification capacity, and they are 
applied to wire-tap channel. By using these formulas, the e 
capacities of the above three problems are considered in the 
most general setting, where no structural assumptions such as 
the stationary memoryless property are made on a channel. As 
a result, we solve an open problem proposed in Han & Verdu[2] 
and Han [3]. Moreover, we obtain lower bounds of the exponents 
of error probability and the wire-tapper’s information in wire-tap 
channel. 

Index Terms —identification code, channel resolvability, infor¬ 
mation spectrum, wire-tap channel, non-asymptotic setting 


I. Introduction 

N 1989, Ahlswede & Dueck [1] proposed the identification 
code as a new framework for communication system using 
noisy channels. However, the upper bound of the rate of the 
reliable identification codes was not solved in their paper. In 
1993, for analysis of the converse part of this problem, Han & 
Verdu[2] proposed the channel resolvability problem, in which 
we approximate the output distribution to a desired output 
distribution by using a uniform input distribution with smaller 
support. In particular, the capacity of this problem is defined 
as the rate of the maximal number of the size of support 
for every desired output distribution. In order to discuss the 
channel resolvability problem, they introduced the concepts of 
‘general sequence of channels’ and the ‘information spectrum 
method’. They gave the relation between identification code 
and channel resolvability, and succeeded in proving the con¬ 
verse part of the capacity of identification code for the discrete 
memoryless channel. In this method it is essential that the 
performances of several problems be characterized by using 
the probability distribution of the random variable with a form 
of ‘likelihood’ function in this method. This insight is very 
useful for obtaining the overview of information theory [3]. In 
particular, it gives a useful insight into quantum information 
theory [12], [11], [10]. Therefore, Han & Verdu’s paper[2] is 
undoubtably the landmark of information spectrum. 

However, while Han & Verdu’s paper gives the capacity of 
channel resolvability for general sequence of channels [2], their 
proof of the converse part contains mistakes as is recognized 
in section 6.3. of Han[3]. They proved the achievability of 

M. Hayashi is with Quantum Computation and Information Project, ER¬ 
ATO, JST, 5-28-3, Hongo, Bunkyo-ku, Tokyo, 113-0033, Japan, (e-mail: 
masahito@qci.jst.go.jp) The material in this paper was presented in part at 
2004 International Symposium on Information Theory and its Applications, 
Parma, Italy, October 2004. 


channel resolvability with the asymptotic zero error setting for 
a general sequence of channels. Concerning the converse part, 
their proof is valid for the asymptotic e error setting when the 
general sequence of channels has a strong converse property. 
However, their proof is not valid in the general channel even 
in the asymptotic zero error setting. 

In this paper, we give several useful non-asymptotic for¬ 
mulas for identification code and channel resolvability, which 
are divided into two parts. One is the direct part of the 
identification code. The existence of a good identification code 
is proved in Theorem (TJ This construction is much improved 
from Ahlswede & Dueck’s construction. The other is the 
direct part of channel resolvability. The existence of a good 
approximation regarding the output statistics is proved in the 
two criteria, variational distance and K-L divergence as in 
Theorem |3 In this discussion, we derived upper bounds of 
the average of the variational distance and K-L divergence 
between the output distribution of a given distribution p and 
the output distribution of the input uniform distribution on 
M elements of the input signal space, when the M elements 
are randomly chosen with the distribution p (Lemma |2}. 
Combining Han & Verdu’s relation between identification 
code and channel resolvability, we derived the capacity of the 
channel resolvability for general sequence of channels with the 
asymptotic zero error setting, which was conjectured by Han 
& Verdu[2] (C3 and o of Theorem 0}. This discussion is 
valid even though the strong converse property does not hold. 

As another application, we give an upper bound of the 
capacity of the channel resolvability for a general sequence 
of channels with the asymptotic e error. As a byproduct, we 
show that there exists a sequence of codes whose second error 
probability goes to 0 in any general sequence of channels, and 
only the first error probability is asymptotically related to the 
probability distribution of the random variable with the form 
of ‘likelihood’ ( (1241 and (1251 of Theorem|4j- We also derived 
several lower bounds of exponent of channel resolvability in 
the stationary memoryless setting with respective error criteria 
(Theorem |6). 

Moreover, we apply our non-asymptotic formulas for chan¬ 
nel resolvability to wire-tap channel, in which there are 
two receivers i.e., the eavesdropper and the normal receiver. 
Wyner[4] introduced this wire-tap channel, and proved that 
its capacity is greater then the difference between the normal 
receiver’s information and the eavesdropper’s information. 
Csiszar & Narayan [16] showed that the capacity does not 
depend on the following two conditions for eavesdropper’s in- 
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formation: i) The eavesdropper’s information must be less than 
ne for given e > 0, where n is the number of transmissions, 
ii) The eavesdropper’s information must go to 0 exponentially. 
However, there are no results giving an explicit lower bound 
of the optimal exponents of wire-tapper’s information. 

Indeed, this problem is closely related to the channel resolv¬ 
ability as follows. In Wyner’s proof [4], in the asymptotic i.i.d. 
setting with a large enough number M, he essentially showed 
that when M elements of the input signal space are randomly 
chosen with a given distribution p, the output distribution of 
the distribution p can be approximated with a high probability 
by the output distribution of the input uniform distribution 
on the above M elements of the input signal space. This 
idea is also applied in Devetak[5] and Winter, Nascimento & 
Imai [6]. Using the same idea in the non-asymptotic setting, 
we can apply our formulas of channel resolvability to wire¬ 
tap channel, and derive a good non-asymptotic formula for 
wire-tap channel (Theorem^. As consequences we obtain the 
capacity of general sequence of wire-tap channel (Theorem|5|i, 
and lower bounds of the exponents of error probability and the 
wire-tapper’s information in the stationary memoryless setting 
(Theorem 0. We can expect that these results will be applied 
to evaluations of the security of channels. 

Finally, we should remark that our non-asymptotic resolv¬ 
ability formula regarding variational distance can be regarded 
as essentially the same results as Oohama[9]’s formula, where 
he treated the partial resolvability. Furthermore, he also de¬ 
rived a lower bound of exponent of channel resolvability by 
type method[8], 

II. Identification code in non-asymptotic setting 

Let W : x i—» W x be an arbitrary channel with the input 
alphabet X and the output alphabet y. The identification 
channel code for the channel W is defined in the following 
way. First, let Af = {1,..., N} be a set of messages to be 
transmitted, and denote by V{X) the set of all probability 
distribution over X. A transmitter prepares N probability 
distributions Q i,..., Qn £ V(X). If the transmitter wants to 
send a message i £ J\f, an encoder generates an input sequence 
Xi £ X randomly subject to the probability distribution Q,. 
In this case, the output signal y obeys the distribution Wq 4 , 
where the output distribution W p of a given input distribution 
p is defined as 


characterized by the following three quantities. One is the size 
N of the message sent and is denoted by |<&|, and the others 
are the maximum values of the two-type error probabilities 
given as: 

d = max W Qi {V?), A(<F) d = maxWg.(£>*), 

i %jtj 


where P{ is the complement set of D,. Concerning this 
problem, as discussed in the following theorem, the ‘likeli¬ 
hood’ function ^r(y) '= f suitablely characterizes the 

performance of good identification codes. 

Theorem 1: Assume that real numbers at,a' ,(3,(3' ,t,k > 0 
satisfy 


k log(-1) > log2 + 1, 1/3 >r>0, 1 > re > 0, 

1 1 1 def 1 1 

1 > —I —ji 7 — 1 757 >0- 

a a' P P 
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Then, for any integer M > 0, any real number C > 0, any 
channel W, and any probability distribution p £ V(X), there 
exists an identification code $ such that 
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— (y) < C 
W v ~ 


A($) < re + a 0 — 

o 


M 

7 


l$l = 


0 tM 


Me 


if 


/3E P . X W X < y 


^(y)<c\+a'(3 1 ^ 


W v 


C 


M 

7 


< 1, 
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where E p x denotes the expectation concerning random vari¬ 
able x obeying the probability distribution p. 

In the following, we omit x or p in the notation E ? , x , and 
abbreviate the set j y ^(y) < C j as j|p^(t/) < C^. we 
also denote the probability that the random variable X belongs 
to the set V, by Px(22) or Vx'D- If we do not need to take 
note of the random variable X, we simplify it to P (D) or I’D. 
This theorem is proven by using the following lemma. 

Lemma 1: (Ahlswede and Dueck[l]) Let A4 be an arbitrary 
finite set of the size M = \M\. Choose constants r and n 
satisfying the condition 0. Then there exist N{= LivTrJ) 
subsets Ai,..., An C A i satisfying 


x 

On the other hand, at the decoder side an TV-tuple of decoders 
is prepared. For every i = 1 ,,N, the z-th decoder judges 
that i £ J\f is transmitted if a channel output y belongs to Vi, 
where {Pi,..., T>n} are N subsets of y in advance. The i-th 
decoder judges that a message different from * £ Af if y ^ P^. 
Here, P, : is called the decoding region, of the message i. It is 
not required that Pi,..., Py be disjoint. In the identification 
coding problem, the z-th decoder is only interested in transmis¬ 
sion of the corresponding message i. Thus, we call the tuple 

rlpf 

of $ = (TV, {<3i,... Qn}, {Pi, ■ ■ ■, Pa}) an identification 
code of channel W. The performance of this code can be 


\Ai\ = [rM\, \Ai Cl Aj\ < ^ j). (4) 

Proof of Theorem [7] In this proof, the subset U x c = 
{^■(y) > c| plays an important role. First, we assume the 
existence of M distinct elements X\,... ,Xm of X satisfying 


W Xi {U c Xi ) < af3E p W x 


Wx 

w„ 


(y) < c 


w x 



< a'P' 


C 


~M~ 

7 


(5) 

( 6 ) 


From Lemma 0 we can choose N *= Lt/tJ su ^ sets 
Ai ,..., An of the set {x \,satisfying (@). Let Qi be 
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the uniform distribution on the subset A whose cardinality is 
|rMJ, that is, Qi is defined as 


Qi(x) d = t4-t X M*)’ 

I l ' x'eAi 


xGA 

(W, 

\w p 


x&Ai 


|A 


>1 - apE p W x \ ^(y) < C \, 


W Qi {Vj) = X 


xGAi 


= E A'W)+ E 

xeAitiAj 1 l| xeAifiAj 1 

<AA + \ • >,m», 


I A-1 1 ^ IA 

xGAiDA” 1 




W P (U X ) < ^W x {u x ) < A. 


1 

C' 


Using this inequality, we obtain 


Since (1 - ^) + (1 - E) 
ail,, Xm 1 such that 


> 1, there exist M elements 


(7) 


177 E 117 -. I U"»J I i“' 


M' 


M’ 


2 = 1 




M' - 1 

C 


where l x / = l.,./(.x') is an indicator function taking value 1 

def 

if £ = x' and 0 otherwise. Defining the subset T>i as V j = 
UxeA.A, we evaluate 

w Qi (2?o = x > X iir^(Wx) 


M' 


M' 


< aEpWcM). 


2=1 


In the following, the above M elements aq,..., Xm' are 
fixed, and we only focus on the random variable i subject to 
the uniform distribution on the set {1,..., M'}. Combining 
Markov inequality and the preceding inequalities, we have 


W x 





>1 -W 


P, {W Xi {U c Xi ) < paE p W x {U c x )} > 1 - 


A 


Hence, we obtain 


W x 


(U,,,A,) < P'a 


/„/ M'-l 


<n + a'l 3'X—]. 

C 7 

Therefore, we obtain the desired argument. 

Next, we prove the existence of M elements and M subsets 
satisfying 0 and 0 by a random coding method. Let M' be 
[—], and X = (Xi,... ,Xm') be M independent and iden¬ 
tical random variables subject to the probability distribution 
p £ V(X), then we have 


Jjfr Ulx i) - ^ ^ c 

W Xi (U c Xi ) < PaE P W X {U C X ) 

X 1 - 4 ) + U - A) - 1 =T. 


X 

which yields 

w xt (u 


P 

■Mx) < P'a 


• / _ / M' — 1 


j^i ^Xj J - H ^ C ’ 

W* 4 (Z/“J < /3aE p W x (W x c ) 


> LtM'j. 


Since l/yM'J = — EL, there exist M elements of 

X satisfying 0 and 0. Here, one may think that these M 
elements may not be distinct. However, if Xi = (i ^ i'), 
the relation W Xi (U x .) + W Xi — 1 holds. From 

condition 0 , this contradicts 0 and 0 . Hence, we obtain 
the desired bound. ■ 


M' / \ M' 

Ex ~M~> X U Ux 3 - Ex X X (Wxj) 

i=1 / f=i 


M'-l , . M'-l M'-l 

E —2 x 3 m' pi M M ,(J - 

i=i i=i 

Further, 


C 


1 (W° j 

Ex mJ X ^ = X E x<w Xi y -^r = E P W X (U C X ). 

2—1 2=1 

Using the Markov inequality Px{X > aEX} < E, /.g., 
Px{X < aEX} > 1 — E, we can show that 

f 1 M ' 1 1 

1A j W , X < oE p W x {W x ) >1--. 


III. Channel resolvability in non-asymptotic 

SETTING 

In the channel resolvability, we choose M elements 
Xi ,..., xm in the input set X for every probability distribution 
p £ V(X), such that the output distribution of the input 
distribution 



2=1 


close enough to the output distribution of p through the 
channel W. In particular, we call the distribution with the 
preceding form an M-type. In this setting, our purpose is to 
disenable the receiver of the given channel W to distinguish 
whether the sender generates the input signal based on ‘the 
given distribution p ’ or ‘the M-type 17 A* with a 

smaller number M’. This kind of indistinguishability can 
not be applied to any realistic model, directly, but it can be 
technically related to wire-tap channel. In particular, we prove 
Lemma 0] in this section as the technically essential part, but 
this lemma is also the technically essential part for the direct 
part of wire-tap channel. 















4 


In the following, we call the pair of the integer M and the 
M elements X \,..., xm of X, a resolvability code ’I' with the 
size T '=' M. The performance of a resolvability code T is 
characterized by its size |\U| and the variational distance 

/ M 

e(9,W p ) d ^ d[Y,—W Xl ,W p 

where the variational distance d(p, q ) defined by 

d(p,q) = ^\p{y) -q{y)\, 

V 

which equals the l\ norm \\p — qj|i. Another characterization 
of its performance is given by K-L divergence 

M 

D(^,W p ) d ^ D(J2jjW Xi \\W p ), ( 8 ) 



Lemma 2: The M random variables X = (X 1 ,..., X M ) 
satisfy the following inequality 


ExIKx - W p \\ t <25 P wc 



e x d(w¥\\w p ) < 


log(l + 

— t 


( 12 ) 

(13) 


for 0 > t > — If the cardinality y is finite, the inequality 

E x D(W¥\\W p ) <v(S PiWi c) + <W,c log \y\ + 

(14) 


holds. 

Since there exists a resolvability code T with the size M 
such that 


where D(p\\q) d = E y P(y) log 

Theorem 2: For any integer M > 0, any real number C > 
0 and any probability distribution p £ V{X), there exists a 
resolvability code T such that |d/| = M and 

e(M>, W p ) < 2 S PiW ,c + (9) 

3p,w,c ^ 

where S p , WtC d = E p W a; |^(y) > cj, and S pWC = f 

| w^i'y) < C). If the cardinality |F| is finite, for any 
0 > f > —1/2, there exists a resolvability code T 1 ' such that 
= M and either of 


D(V, W p ) 

^log(l + M t e^ w ’ p '>) 

~ -t 

(10) 

D(*', W p ) 

6' 

<r]($ P ,w,c) + fi P ,w,c log F| + 

(11) 

holds, where 

rj(x) = f —a; log a: and <j>(t\W,p) 

def 

logE y (E P IFi /(1+t) (y)) 1 +*. 



Remark 1: The partial resolvability version of inequality 
@ has been obtained by Oohama[9], Inequality 0 can be 
regarded as the essentially same result as Oohama’s inequality. 


Proof: In the following, the indicator functions and /): 
on the sets U x = | ^f{y) > C j and their compliment sets U x 
play important roles. In our proof of Theorem [2] we use the 
random coding method, i.e., we consider the M independent 
and identical random variables X = (Xi ,..., Xm) subject to 
p. Using the notations: 

W?(y) = W x (y)I c x (y), Wg{y) = W x (y)I x (y), 

Wp(y) = E p W?(y), Wg{y) d ^ E p Wg(y), 

M M 

w $(v) = JfE W xM w*{y) = ^J2 w i(y), 

1=1 i =1 

1 M 

Wg(y) = W%(y) + W$(y) = jjJ2 


e(9,W p )<E x \\w¥ 

the inequality O guarantees the existence of a resolvability 
code T satisfying 0- On the other hand, the relation = 

I c Av) < C holds. Thus, 


dp,W,C 




Similarly, since there exists a resolvability code T with the 
size M such that 


D(V,W P )<-E X D(W™\\W P ), 

the inequalities O and (HI guarantees the existence of a 
resolvability code T satisfying GoJ and {□}• ■ 

Proof of Lemma 
First, we show m. Since 

S P ,w,c = E P>X W X {U X ) = E p \\Wg\\ = \\Wg\\, 

we can evaluate 


Ex 

=Ex 


wg-w p II, 


wz - w° + wi- wg 
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<E X \\W% -W^ 

=Ex||lU?-W p “|| 1 


V —Ex 
^ M 


IF 


Xt 


2E p , x W x (U x ). 


IlF^II 

I yr p 111 


Next, we focus on the Schwarz inequality regarding the 
random variable lx(y) '=' i^r{y) an< l the sign function 
lx(y) c = f (we can check that l 2 x = 1.), then we obtain 

(||WpZx||i) 2 = (E Wp \l x {y)\) 2 = (E Wp lx{y)lx{y)) 2 
<E w J 2 x(y) E wJ 2 x(y) = e wjx(y)- 


Thus, the Jensen inequality yields that 

(Ex \\w% - if;IIJ 2 < Ex IIif; - if; 


we have the following lemma. 


2 < ExE Wplxiy)- 
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Since Eg. !!/ fr? = ! y < , we have 

x w p (v) w p (v) ’ 

ExE w p lx = ^w p ^xl 2 \(y) 


1 M 

=e ^ E '^E 

i= 1 


=Ew — E r 
Kp M 


Therefore, we obtain 
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w Xj (y) Wp(y) 
W P {y) W p (y ) 


(W2{y)\ 2 

(W«(y)\ 2 \ 

V W p (y) J 

\W P (y)J ) 


w°(y) 

W p (y) 


2 Si 


E 


x 


E Ti w *- - w ’ 


p.W,C 
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< 25p,w,c + 


'S. p ,w,c 

M ‘ 


<Ex E (Wxiy) - l) + W&y) log 

-E=*"sw(^ 


y ' x ' w/ ' y 

Regarding the first term, we can calculate 

^ExH^(y) ( M/ “ (y) 
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Wg(y) 

-i) + E^)‘» 6 J 
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=E e ^4E^.(!') 
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=E 4 e ^ w ^) rs®-i 
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<e 4 e -S^^)= w 


y 


M p ’ x fE p (y) 


where we use the relation ExVU? 

i ± j. Cc 
we have 


M 

'WxXy) 


J2 w p(y) 

y Wp (y) 

x- W£(y) Wj?(j/) 

= - AEog/v - log 

y 

<y(Y w p(y)) + E w £(v) lo s M. 


y 
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Finally, we prove o by a different method. The quantity 
Ex£ , (W / Y / ||fE p ) can be regarded as the mutual information of 
channel X <—> W x with the input probability p M (X ) which 
equals the M -fold i.i.d. of p. We can check that the function 
t h-> <p{t\W M ,p M ) satisfies the following property: 

0(O| W M ,p M ) = 0 
d(j){t\W M ,p M ) 


dt t =o 
d 2 cf>(t\W M ,p M ^ 


= -V x D(Wg\\W P ), 


dt 2 


> 0 . 


i=l 

Hence, we obtain (II 2\ . 

Next, we show dll. Since by using the 

v\V) Wp (y) 

inequality log x <x—l, we can evaluate 
E x D(W¥\\W p ) 


Hence, its convexity guarantees the inequality 
—t FixD(Wx\\W p ) < <j>(t\W M ,p M ), which implies 
the inequality 

(f)(t\W M ,p M ) 

VpJ ~ 


E x D(Wg\\W p ) < 


—t 


(15) 


for 0 > t > — j- 


Let 1 + s = then 1 > s > 0 and t = Since 


l+t 

x i—» X s is concave. 


Ex(E w Xj ( y)Y < [Ex E w * (y)] s = ( M - 

(16) 

Using d and the relation ( x+y) s < X s + y s for two positive 
real numbers x, y, we obtain 

e mw M ,p M ) = J 2 (E x (W™) 1+ \y))^ 
y 

( M \ TT7 

Ex E w *< (v) { w *< (y) + E (y )) 5 

*=1 ) 

( M \ TTJ 

E Ex E W Xi ( y ) (w° Xi (y) + (E W Xj (y)) *) 
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y \ i =l 
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=sE 


y \i=i 
M 


Y E *WxM (E 




- 1 = 0 for 


^KKjf 4 (y) (^w^T 
i ^ j. Concerning the second term, letting K f = f W£(y), 


< E ^ (E E ^^! s (y)+E( M ^ i) s ^ P 1 +s (y)) 

y \i=l i=l / 

=j] E (^EgfUi+ s (y) + M(M - iy\VY s (y)) ^ 

y 

<T E (ME x Wi+ s (y))^ + (Af(M - l) s TU p 1+s (t/)) ^ 


M 


M 


=E 






0 (t|tV,p) 


because log |3^| is the maximal entropy of the distribution 
on the probability space y. Since J2 y W^(y) = <5 p ,ty,C> we 
obtain HI- 


^ + 77^ E (ExWg 1+s (y))= 1 + MV 

y 

Since —t is positive, the desired inequality d 1 31 follows from 
E3 and the above inequality. ■ 
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Next, we proceed to the relation with identification codes. In 
order to discuss this relation, we focus on channel resolvability 
of the worst input case, and define the following values: 

Hpf 

e(M,W) = max min e(\E ’,W P ), 

D(M,W) d = max min D('&,W„), 

pe'P(X)9:\'5f\<M 

which satisfies 

e(M, W) < 2maxE p H4 ; ||f(y) > cj + (17) 

for any real number C > 0. 

Lemma 3: (Han & Verdu[2]) If the cardinality \X\ is finite, 
and if an identification code $ and an integer M satisfy 

l-/r($)-A($) >e{M,W), 

then 

\X\ M > |$|. (18) 

Proof: Let the identification code $ be a triplet 

(N, {Qi,... ,Qn}, {T>i,... ,T>n}), then there exist N M- 
types Q[,... Q' n such that 

d(W Qi ,W Q ') <e(M,W). 

Since the inequalities 


2 e{M, W) + d(W Q ,,W Q ,) 

>d(W Qi ,W Q .) + d(W Qj , W Q ,) + d(W Q , , W Q ,) 
>d(W Qi ,W Qj ) > 2 (W Qi (Vi) - W Qj {Vi)) 

>2(1 -/*($) - A($)) 

hold for any i Y j, we can show 

d(W Q >,W Q >) >0, 

which implies that Q' i is different from Q'. However, the total 
number of M-types is less than X | A1 . Therefore, we obtain 

m. ■ 

IV. Wire-tap channel in non-asymptotic setting 

Next, we discuss the message transmission with the wire- 
tapper who has less information than the main receiver. This 
problem is formulated as follows. Let y be the probability 
space of the main receiver, and Z be the space of the wire- 
tapper, then the main channel from the transmitter to the main 
receiver is described by W B : x i—> Wf , and the wire-tapper 
channel from the transmitter to the the wire-tapper is described 
by W E : x i—> W E . In this setting, the transmitter choose 
M distributions Qi,... ,Q y; on X, and he generates x € 
X subject to Qi when he wants to send the message i G 
{1,..., M}. The normal receiver prepares M disjoint subsets 
T>i ,..., T>m of y and judges that a message is i if y belongs to 
T>i. Therefore, the triplet (M, {Qi ,..., Qm}, {2?i, ..., V M }) 
is called a code, and is described by <t>. Its performance is 
given by the following quantities. One is the size M, which is 


denoted by |T>|. The second one is the average error probability 

£b($): 


M 

i=1 

and the third one is the wire-tapper’s information regarding 
the transmitted message Je(<!>): 

i E ^) d ^J2^D(wg\\wi), < = 

i i 


A different measure of the wire-tapper’s information is given 
by the average variational distance d# (<!>): 


d E { ( $) 


def 


l 

M(M - 1) 




Theorem 3: There exists a code $ for any integers L, M, 
any real numbers C, C' > 0, and any probability distribution 
p on X such that 


|$| = M 

e B ($) < (19) 


es($)<3(E p W x 


'W. 


y 

B 
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(y)<c\ + ff 


Ie(®) < 3 I r/(S Pt w E ,c) + $p,w E ,C l°g \2\ + 


u p,W E ,C 


Ie(^) < 3 min 

0>t> —1/2 


log(l + L t e^ wE ^) 


—t 


d E {$) 4 6 26p \yE,c + 


' S P,W E ,C 


( 20 ) 

) 

( 21 ) 

( 22 ) 

(23) 


Proof: We prove Theorem[3]by a random coding method. 
Let X = (X^m) be LM independent and identical random 
variables subject to the distribution p on X for integers l = 
1,L and m = 1,..., M, and V[ m (X) be the maximum 
likelihood decoder of the code Xi >rn , then we can evaluate as 
follows by Gallager upper bound[7]. 


l,m 

< Q nun(ML) s £ (e p (Wf (j/)) 1/(1+a) ) ^ . 
y 

Since the maximum likelihood decoder is better 
than the code VfJX) = (^r(y) > C"} \ 

\wfr{y) >C'}, we have another evaluation 
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^J^J2 W X m AKm(X) C ) 


l,m 


^xm'E^-'Wrnixr) 


<K VTrrX" 


ML 

E x 


W- 


x„ 


l,m 

1 

~ML 


W B 


<y) < C 


Y, W L, E 

l,m (l' 


w B 


w p B 


T~(y) < C 


<E P ,,Wf \ ^l(y) < C 


W p B 


+ W b {ML - 1)E P 


wl 

W B 


(y) < c 


<E p, x W B 


Wl 

W B 


fa) <C) + ^. 


Let Q m {X) be the uniform distribution on 
{X 1>m ,...,X L>m }, V m (X) be U iD’ lm {X), and $(X) 
be the code (M,{Q m (X)},{D m (X)}), then E*e B ($(X)) 
is less than the right hand sides of (1191 and Co) because the 
average error probability of <h ( X ) is less than the one of the 
code (ML,{X mtl },{V ltm {X)}). 

Since 


M 


E Tf D ^ m(X )\\Wi (x) ) + D(Wi {x) \\W B ) 


m =1 

M 


M 


= E \\Ki 


m =1 

we obtain 


Using Markov inequality, we obtain 

P A -{e s ($(X)) < 3Ee s ($(X))} c < ± 
PxfWPQ) < 3 E/ b ($(X))} c < i 
P x {d E mX)) < 3Ed £ ($(X))} c < 1 
Therefore, there exists a code $ satisfying desired conditions. 


V. General asymptotic setting 

A. Identification code and channel resolvability 

Next, we focus on an arbitrary sequence of channels W = 
in which W n is an arbitrary channel from X n to 
y n . In this setting, two-types of (//.. A)-identification capacities 
are defined by 


D(fi, A|W) 


def 


l 


= sup < lim — log log | 

{•*>„} l n 

A|W) 


def 


l 


= sup •{ lim — log log | $ 

{4>n} 


lim n($n) < /r,limA($„) < A 


lim/r(T>„) < /r, lim A($„) < A \ . 


However, in the case of y = 0, we replace lim^t($ rl ) < 
/x, (li< n) by lim/z($„) = 0, (lim= 0) at 
the above two definitions. On the other hand, two-types e- 
resolvability capacities are defined by 

S(e|W) d = sup{R\\ERe(e nR ,W n ) < e} 

5' t (e|W) d = sup{f?|lime(e"' R ,kP ri ) < e} , 


1 

E x I E mX)) = E x JJ D ( W Q m (x)\\Wi {X )) 

m= 1 

AI 

<vx E m d wLw\\K) 

m =1 

< r i{d pW E ,c) + $p,w E ,c 1°S 1-21 + P ’W C , 

where the last inequality follows from Lemma 0 Similarly, 
we can show 


E A / B ($pO) < 


log(l + ZAe^l^’Pl) 
— t 


Regarding dE(&(X )) 

E A 




) 


M(M-l) ^ 

T, d ( W Qdx),W p E )+d(W B AX 

iAj 


- Ex M(M -1) ^ 
=2E xd(W B l{x) ,W p E 

<2 [ 2 S Pt w,c + 


I l,w,c 


where the case of e = 2, we replace < e by < 2 at the above 
two definitions. 

In the information spectrum method, the following quanti¬ 
ties are defined for arbitrary sequence p = {p n }%Li of input 
probability distributions: 


I(e |p,W) 


def. r i 

= mi < a 


lim E p n Wf. 


I(c|p,W) 


def . r 

= mi 


1 w n 

-‘og^falx.^. 


WI‘ 


limEpnlU” < — log jUiy) >a < 


n 


ws 


where the case of e = 1, we replace < by < at the above 
definitions. These quantities have another expression as 


I(e |p,W) 
= sup < a 


lim E p n W£ 


1 i W x . 

-log—( 2 /)<a^ < 1 — e 


P 


I(e|p,W) 


= sup < a 


limE p « W" 1 - log 


1 


W3, 


iv) < 


< 1 - e 
























1 

n+2 ’ 


Theorem 4: Assume that \X n \ = d n , then the above quan¬ 
tities satisfy the following relations. 


□ then the conditions Q and Q are satisfied and 7 = 
Thus, there exists an identification code <t> n such that 


sup J(e|p, W) < D{ 1 - e, 0|W) < (e|W) 

P 

< SU p/(i|p,W) (24) 

P * 

sup7(e|p, W) < D\ 1 - e,0|W) < 5(e|W) 

P 

<supl(^|p, W), (25) 

P Z 

for any real number 0 < e <1. However, the first inequalities 
in d and E3 hold for 0 < e < 1 , and the third ones hold 
for 0 < e < 2. In particular, we obtain 

sup/(0|p, W) = D( 1,0|W) = + (0|W) (26) 

P 

sup7(0|p, W) = (1,0|W) = S(0|W), (27) 

p 

which is desired in Han & Verdu[2] and HanP] 1 . 

This theorem indicates the existence of a code satisfying the 
following: The second error probability A is asymptotically 
independent for the behavior of the distribution of the random 
variable of likelihood and always goes to 0 , and only the 
second error probability // asymptotically depends on it. 
Remark 2: Steinberg[14] claims the inequalities 


sup/(e|p,W) >D(X 1 ,X 2 \W), 

P 

sup7(e|p, W) > £> t (Ai,A 2 |W) 

p 

for Ai+A 2 < 1—e. If they are proved, by combining the above 
inequalities and Theorem|4] we can prove the equalities of the 
above inequalities in the continuous case. However, it seems 
that his paper has a gap in counting the maximum number of 
different pairs of a partial response and an A/'-type measure at 
the proof of Lemma 2, which is essential for these inequalities. 
That is, he estimated the total number of positive functions on 
X xy with the form 

1 M ' 

f( x iU) = 1 (x',y')(x,y), 

2=1 (x',y')EF 


where F is an arbitrary subset of X x y. The total measure 
of /, i.e ., J2( x y) e xxy not nece ssarily less than 

1, while he indicated that it is less than 1. Hence, this total 
number cannot be bounded by \X\ M . 

Proof: In order to prove the first inequalities, we choose 
an arbitrary real number R < sup p J(e|p, W) and a sequence 


def 


of input probability distributions p such that R < R’ = 

,a = /3 = 


/(1 — /i|p,W). Substitute M = e nH ,C = 


0 nR' 


log 2+1 
n+2 ’ log n 


in Theorem 


theorem 6 in Han and Verdu[2] claims that S'(OIW) = sup p /(0|p, W) 
always holds for any channel W if the input alphabet is finite. However, the 
proof in [2] contains mistake in part, as is mentioned in section 6.3 in Han[3]. 
Therefore, it has been an open problem as to whether this inequality holds or 
not. 


I$J = 


e n + 2 

p l-\-nR 


Wf 


M*») < (i + -?v P nW? j-tog^O/) < R' 

A( ®“> s + ("+ 2 > 2 +f("+ *r*\ 

s lQ f 2 + 1 + (n + 2 f e - n ( R '- R) . 

logn 

Therefore, we obtain 


lim — log log !$„! = R, 
n 


1 


W? 


lim/r($„) < limE p « W™ t -log—^-(y) < R' \ < y 


n ° Wf 


(28) 


limA(T>„) = 0 , 


which implies that 0|W) > R'. Thus, we obtain the first 
inequality in for 0 < e < 1. In the case of e = 1, we need 
to replace < y by = 0 at ca. By replacing lim by lim at (EU, 
we can similarly prove 0[W) > sup p /(l — ^t|p, W). 

Next, we proceed to the second inequalities. Let R be an 
arbitrary real number such that R > D( 1 — e,0|W). Then, 
there exists a sequence {< 1 >„} of identification codes such that 

R = lim-log log |4>„|,lim/i(T>„) < 1 - e,limA(<f>„) = 0. 
n 

Therefore, we can choose an integer N large enough, such 
that 1 — n(Q n ) ~ A(<l> ra ) > 1 — limp(<E>„) > e. Moreover, we 
choose a strictly increasing sequence {a n } of integers such 
that ai > N and 1 — lim^(4> a „) > e{e anR ' ,W n ), where 
R' = +(e,W). 

Thus, Lemma 0 yields that ( d an ) eariR > |$ ari |, which 
implies that R' > R. We obtain the second inequalities 
in ED. We can prove the second inequalities in < 1251 by 
choosing a strictly increasing sequence {a n } of integers such 
that 1 -- A($ a „) > 1 - lim/r(T>„) > e. 

Finally, we prove the third inequalities by using another 
expression of sup p /(e|p, W): 


sup/(e|p, W) 

p 


= inf < a 


lim max E„n W x 

p n 


-log 

n 




(y) > a 



Let R and R' be arbitrary real numbers such that R > 
sup p /(e/2|p, W) and R > R' > sup p I(e|p, W), then the 
inequality o yields that 


e(e nR ,W n ) 

< 2 minE p .W;" ji log ^-(y) > f?'| + e ~^ R - R 'V 2 . 

Taking the limit lim , we obtain 


\jme(e nR ,W n ) < e, 


(29) 
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which implies 5^(2e,W) < R. Thus, we obtain the third 
inequality in for 0 < e < 1. In the case of e = 2, we 
need to replace < e by < 1 at (I29> . By replacing lim by lim 
in the above, we can prove the third one in C3- ■ 


B. Wire-tap channel 

Next, we focus on a general sequence (W B = 
{W B,n }, W B = {W E ’ n }) of wire-tap channels, and define 
the following two kinds of capacities by 


C d (W B W B ) 
d = sup ( lim - log |$„| 

{<t>n} l Tl 


limes (<!>„) = lim cfe (<!>„) = 0 


C/( W B ,W E ) 

= f sup { lim - log |4>„| 

{3>„} l n 

Lemma 4: The inequality 

C d ( W B , W B ) > J(l|p, W B ) - 7(0|p, W B ) (30) 

holds for any sequence of input distributions p = { p "}. 
Furthermore, if \Z n \ = d n . 


limes('l ) n) = lim ———— = 0 


Cj( W B ,W E ) > 7(l|p, W B ) - /(0|p, W B ). (31) 
This theorem is an information spectrum version of Wyner’s 
result [4], that will be mentioned in the next section. 

Proof: Let R' > 7(0|p,W B ), R < I(l|p,W B ) - 
R' and choose a real number a such that 0 < a < 

min{7(l|p, W B ) - (R + R'), R' -7(0|p, W B )}. Substituting 
M = e nB , L = e nR ', C = e n ( R '- a \C' = e n ^ R+R ’ +a \ we 
can show that the right hand side of CD goes to 0, and that 


S p „ W E,n te n(R'-a) 


■o, 


S' 


■ 0. 


Hence, the right hand side of o 
the relations 


guR' 

go to 0. Concerning CD, 


n 


W E,n e n{R'-a)) + S pW E,n )S n(H'-a) log \Z'‘ 


S' 

pfi E,-n , e n(R —a) 

^ e nR' 


■) 


— — 7 l(fip’ l ,W E ’ n ,e n <- R '- a '>) logd 

1 3' p n tW E, n >e n(B'-») 

ip pTiRJ 


hold. Therefore, we obtain 1301 and J3 1 1 . ■ 

Conversely, we obtain the following lemma. 

Lemma 5: Let Q = {Q n } be a sequence of channels from 
arbitrary set X n to the set X n and p = { p "} be a sequence 
of distributions on X n . Then, the inequalities 

C d { W B , W B ) <sup {J(l|p, W b Q) - 7(0|p, W b Q)} 
pQ 

(32) 

CV(W S ,W B ) <sup {/(l|p, W b Q) — 7(0|p, W b Q)} 

P.Q 


hold, where WQ = {W n Q n } denotes the sequence of 
channels from X n to y n \ 

( W n Q n h(y ) d ^ f E W^{y)Q n i {x) 

x£X n 

for a sequence of channels W = {W n } from X n to y n . 

Hence, applying Lemma 0] to the sequence of the channels 
w b q,w b q, we obtain the following theorem. 

Theorem 5: 

C d { W B ,W B ) =C/( W B ,W B ) 

= sup {/(l|p, W b Q) —7(0|p,W B Q)} . 
p,Q 

Proof of Lemma 0 Let { <l> n = 

(M„, {Q^,.. -,QM n }, {2?", • • • ,£>m„})} be a sequence 
of codes of wire-tap channel such that 

R = lim - log |$ n |, lime#(<I> n ) = 0, limds($ n ) = 0. 
n 

Hence, Verdu-Han’s result [19] yields that the transmission 
capacity of the sequence of channel W B Q is less than 
7(l|p, W S Q), which implies 

R<L( i|p,w b Q). 

Furthermore, the property lim<7 e(<I>„) = 0 implies that 
S'(0 |W b Q) = 0. Hence, we have 

7(0|p, W b Q) = 0. (34) 


Thus, we obtain 

R = L( i!p,w s q)-7(o| p ,w b Q), 

which implies 03. 

Next, we assume that a sequence of codes of wire-tap chan¬ 
nel {4> n = (M n , {Q?,..., Ql In }, {:D r,..., V n M J)} satisfies 
that 


R = lim — log | 
n 


Since the mutual information 

M n 


lime s (<E>„) = 0, lim ^^^ = 0. 


n 


(W E ’ n Q n )i 


lE ^n) g M E (W E„ Qn)iiy log J_ {W E, nQn)i 

can be regarded as KL-divergence, Lemma |6] yields that 

M„ „ ( _ 


(y) 




(W E ’ n Q n )i 




(y) > 


M, 

2=1 

< lE{<S>n) + \ Q 
na 

for any a > 0. Thus, we obtain < I34I> . Therefore, similarly to 
d32> . we obtain (I33> . ■ 

Lemma 6: Assume that p and q are two probability distri¬ 
butions on H. Then, we have 


1 


D(p\\q) + - > a ■ p log > a 




(35) 


Proof: We focus on the two probability distributions on 

d = {logout) < a}: 

, , def p(w) , , def q{ui) 

PoM = ... , , goM = 


p{H 0 } ’ 


g{H o} - 


(33) 
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Hence, 

D(pk) = E pH log S + E pH log ^ 

>ap{fig} + E PoH (log P !^°! + log 

^ V 9^o I 9o M 

=Qp{^o) + P-ffiol log P fn\ + D (Po ll9o) 

>ap{^o} + p{n 0 } log P j^°| > otp{%} + p{fi 0 } logp{H 0 }. 
<7l“o j 

Finally, the convexity of the map x i—> x log x guarantees that 
p{fl 0 } logp{S2 0 } > We obtain l35l . ■ 


VI. Exponents in stationary memoryless channel 
A. Channel resolvability 

Next, we proceed to the stationary memoryless channel of 
a given channel W as a special case. 

First, we treat channel resolvability. As was shown by Han 
& Verdu[2] and Han [3], the information spectrum quantities 
of discrete memoryless channel of W is calculated as 

sup/(e|p, W) = sup J(e|p, W) = max/(p; W) 

P P P 

for 1 > e > 0, where 

I(p; W)=E P D(W X \\W P ). 

Hence, Theorem [4] yields 

S(e|W) = N t (e|W) = max/(p; W), 


Theorem 6: Assume that the cardinality |J/| is finite, then 

—ip{s\W,p) + sR 


e e (R\W,p) > max 

l>s>0 


1 + s 


en{R\W,p)> max {— <j>(t\W,p) — tR} 

0>t> —1/2 


eJR\W) > max 

l>s>0 


—ip(s\W) + sR 
1 + s 


(36) 

(37) 

(38) 


eniRlW) > max < — max<j)(t\W,p) — tR > , (39) 

0>t>-l/2 [ p 


where tjj(s\W,p) d = logE P J2 y W x +s (y) w p s (y) and 

^{s\W) d = logmaxp^y (Ho¬ 

using Pinsker’s inequality D(p\\q) > |p — q|| 2 , we obtain two 
inequalities ^en{R\W,p) < e f (R\W.p) and | eu{R\W) < 
e f (R\W), which implies different lower bounds of exponents: 

e e (R\W,p) > ^ max^ {— <j>(t\W,p) — tR} (40) 


eJR\W) > - max < — max <j)(t\W,p) —tR>. (41) 
2 o>t>— 1/2 [ p J 

We can derive different lower bounds of eu(i?|W,p) and 
ei;>(R\ W) from the inequality dill . However, these bounds 
are smaller than the bound presented here. 

Remark 3: Arimoto’s strong converse exponent [15] of 
channel coding of transmission code equals 


max {— max d>{t\W.p) — tR 
o>t>-i [ p 

which is a bit greater than the RHS of G3 when R is 
sufficiently large. 

Remark 4: By using inequality <|9} and type method, 
Oohama [8] has obtained a lower bound of e e (R\W): 


which has been obtained by Han & Verdu[2], Furthermore, 
using Theorem [2] we can discuss these problems in more 
details by treating the following optimal exponents: 


ee(R\W,p) 

d = sup { lim — log e(T'„,, W pn ) 

{*„} l n 


lim — log | \D n | < R 
n 


cd{R\W,p) 

d = sup ( lim — log D(* n , W™ ) 

{*„} l n 


lim-log|T'„| < R 
n 


- max < — maxd)(f|W, p) — tR> , 

2 0>t> —1 [^ p j 

which is a bit better than GU when R is sufficiently large. It 
is interesting that his approach is in contrast to our approach 
to ( 14 1 li . which is based on d 1 01 not on @- 

Remark 5: It is difficult to treat the exponent of the sum of 
two error probabilities in identification code based on Theorem 
Q For this purpose, we need a modified version of Theorem 
□ 

The following lemma is a preparation of our proof of 
Theorem |6] 

Lemma 7: For any s > 0 and 0 > t > — 1, the equalities 


and 

e e (R\W) d = \jm — \oge(e nR ,W n ), 
n 

e D (R\W) d =Um — \ogD(e nR ,W n ), 
n 

where p n is the rc-fold identical independent distribution of 
p. As is discussed by Oohama [8], by using Lemma 0 the 
exponent e e (R, W) gives a lower bound of strong converse 
exponent of identification code. hold. 


E (e„(>w)) i+ 1 

; y n £y n 


^ E (e,<>w))A) 


i+t 




( max 

peP(x) 


E (e p w/ +t 

ii ' 




n 


(42) 


( 43 ) 
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Proof: Since EH has been shown by Arimoto [15], 
we prove only S3 by the same method. Since the function 
/ : p h-+ max p6 p(^) J2 y (E p W,/ +s (y)) s is continuous and 
convex function, if and only if f{p*) = max p f(p), there 
exists a constant A such that 




df / = A if p*(x) > 0 


j rey n 


= , E (E P HW‘))'+‘) 


1 — S 


y n & yn 


It implies the equation i42\ . ■ 

Proof of Theorem |6} By inequality dlOl of Theorem El 
we have 


D(e nH ,W. p r>) < 


< 


log(l + ( e ™fl)t e ^(t|M" n ,P n )) 
—t 

(e nR ) t e^^ wn 
—t 

e n(<j>(t\W,p)+tR) 

~t ’ 


(44) 


e D (R\W,p) > —(/)(t\W,p) — tR 


(45) 


for 0 > t > —1/2. Since (f>{t\W,p) + tR is continuous for t, 
the inequality d37l> holds. By inequality dlOl and Lemma |7| 
we have 


D(e nU , W n ) < max 


l0g(l + ( e nii)t e 0(t|W n ,p)) 


< 


p<ET(X") -t 

(e nR Y ■ max pGV{xn) 


~t 


dp(x) \ < A if p*(x) = 0 

Indeed, A is calculated as 

J2p( x ) x = ^2p( x ) E w x +s (y) (j2p*i x ) w x +s {y)\ 

x x y \ x / 

Thus, if and only if f(p*) = max p f(p), 

E^ 1 + 5 (y) 

y \ x / 

[ = (E X p*( x ) w x +a (y)) 1 ~ s if p*(x) >o 

1 < (E x P*{x)Wl +s {y)) 1 ~ b if P*(x) = 0 

p* gives the maximum. Hence, if p* satisfies the above 
condition, ( p*) n also satisfies the following condition: 

E(^) 1+S (y") ( E(pT (*”)) 1+8 (?/")) 

{ = (Ex4p*)"(^ n )(^" ) 1+s (y n )) x ] s if (p*) n (x n ) > 0 
1 < {E x n(pT( xn )(W? n ) 1+s (y n )y s if {p*) n (x n ) = 0, 

which is a necessary and sufficient condition for 

E (E(p*)n (W" (t / n )) 1 + S ) 1_S 


(e nR y • maxpg-p( ^) e 
-t 


n0(t| W,p) 


(46) 


for 0 > t > —1/2. Hence, in a manner similar to the derivation 
of (1371 from (1441 . we obtain (El from ESJ. 

Next, we derive EH and EH- To this end, we first derive 
an upper bound of 


^p,W,e R ' + 


16' w s' 
p , W } e H 


For any 1 > s > 0, we choose R' = f gy us j n g 

Markov inequality, we can evaluate 6 p We R' and &' p We R' as 

r e- R 'W x (y)Y 


W P (y) 


X p,W,e R ' < E P E W x{y) 

y^{wf(y) >eR '} 

<E p E w x (y) f e~ sR ' = e^( s l w*)-*# 


\W p (y) 


ip(.s\W, p)-aR 

—e i+» 


and 


W s e p E ^ ‘ y) 


(47) 


1 — S 


W p (y) \e~ R 'W x {y ) 

ye{ w®-(y)< ejR } 

EM Y *>(!-+>«' = e V>(«|w,p)+(i-*)Ji' 


respectively. Inequality EH yields 


(48) 


W,p) + (l-s)R'-B «Ms|W,p)-sR 

p ’ ’ < e-^-= e i+s . (49) 


o-R 


Combining EH and J49l , we have 


_ _ ^■ P W*>R' if>(s\W,p) — sR 

Mp,w, e « + y < 3e i+- • (50) 

Hence, EH and @ in Theorem El guarantee that 


^(. B \w n , P n )-snR ..»(.|ir,p)-«a 


e(e nH , W£) < 3e - m - = 3e n -r+5- (51) 

for 1 > s > 0 because ip( s \W n ,p n ) = n'tj)(s\W.p). Thus, 
E3 implies that 


for 0 > t > —1/2, where the second inequality follows from 
log(l + x) < x. From EU> we obtain 


e e (R\W,p) > 


—ip(s\W,p) + sR 
1 + s 


(52) 


for 1 > s > 0. Taking the maximum for 1 > s > 0, we obtain 

EH. 
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We proceed to the proof of By inequalities @ and 
( 1501 . we obtain 


ip(s\W' n -,p)-snR 


e(e nH ,W n )< max 3e-~ 


— snR 

= 3e !+» 


per(x«) 

V>(s | W n ,p) 

max e r + 5 
penx") 


Now, we estimate an upper bound of 

e*Ww",p) = Ep £ Wl +s {y)W~ s {y). 


(53) 


(54) 


Since the map x i—> ar +s is convex, we have 

W p (y) = E p W x (y) >E P W' +S (y), 
which imply that 

W~ s {y)< (E p (W x (y)) 1+s )~ s . 

Hence, the relations 

E p J2 w * +s (y) W p s (y) = Y, E P w * +s (y) W P S M 


<E( E P Wl +s (y)) 


1 —s 


hold. Using and Lemma Q we can evaluate 

ma x E p J2 W:(y) 1+s W^y)- s 


p£V(X n ) 


ye yn 


' y&y n 


(55) 


= ^ max^ (EpW-i+^y)) 1 = e »W(«l"0). (56) 

Combining ED and (EH, we have 


Theorem 7: Assume that the cardinality \Z\ is finite, then 
there exists a sequence {<1' „} of codes for any real numbers 
R, R' and any probability distribution p such that 


lim — log |$„| = R 
n 

lim—- loge B (<E> n ) > max \—d)(s\W B ,p) — s(R + R 1 )} 
- n i>s>o 1 J 

(58) 

lim—- log/ B ($„) > max {— f(t\W E ,p) — tR'} (59) 
n 0 >i>— 1/2 L ’ 


lim — log^E^n) > max 
n i>s>o 


— i/>(s\W E ,p) + sR' 
1 + s 


(60) 


lim —logd_E($ n ) > i max {-(/)(t\W E ,p) - tR'} . 
n 2 o>t>— 1/2 


(61) 


Indeed, these exponents are very useful for evaluating error 
and wire-tapper’s information for a finite n. 

Proof: The inequality El immediately follows from 
EH. By using an evaluation similar to El. we can show 
El from ED- Furthermore, by using an evaluation similar to 
El- we can show ( I60> from El- " ■ 


VII. Comparison of lower bounds of exponents 


Finally, we compare the lower bounds El. El- El- and 
(14 1 1 of error exponents of channel resolvability. 

Theorem 8: Assume that A '=' R — I(p; W) is sufficiently 
small. Then, RHSs of El and El (which are lower bounds 
of exponent of the variational distance) are approximately 
calculated as 


RHS of El 
RHS of El 


f -f>{s\W,p) + sR\ ^ A 2 

max < -> = — - -- 

i>s>o [ 1 + s J 4 J(p\W) 


- max {— cj)(t\W,p) — tR} = 

2 0>i> —1/2 1 


A 2 

8 J(p;W)' 


where 


■0(s| W)-sR 


e ( e nK r) < e"-1+7 


(57) 


for any 1 > s > 0. In a manner similar to the derivation of 
(I36> from (15II . we can derive ( 1381 from < 1571 . ■ 


B. Wire-tap channel 

Next, we proceed to discrete memoryless wire-tap channel. 
Applying Theorem [4] to this case with the input identical and 
independent distribution, we obtain 


C( W B ,W f 


1 


lim — log log |<f+ 


def 

= sup 
{<*>-} 

> sup { I(p; W B )-I{p ; W E )} , 

V 


lime B ($„) = limd B (T> n ) = 0 


which has been obtained by Wyner [4]. Hence, Theorem El 
can be regarded as a general extension of Wyner’s result. 
Moreover, using Lemma [3] we derived several explicit lower 
bounds of exponents. 


J(p; w) 

A = \ {^p,xE Wx ,y^ogW x {y) - log W p (y)) 2 - / 2 (p; W)) . 

Moreover, RHSs of El and (which are lower bounds of 
exponent of the worst variational distance) are approximately 
calculated as 


RHS of El max 

s>0 


— i/j(s\W) + sR 
1 + s 
A 2 


4(+(po; W 7 ) + E po i?(W x )) 


l 


RHS of (14111 - max < — max d>(t\W, p) — tR 
2o>t>-i/2* - Vl ’ 

A 2 


8J(p 0 ; W) ’ 


where po = argma x p I(p;W). 

Thus, when R is sufficiently close to max,, I (p: W ), J36I gives 
a better lower bound than (1401 . Of course, this comparison can 
be applied to exponents of eavesdropper’s information in wire¬ 
tap channel, i.e., the comparison of RHSs of EH and ED- 
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On the other hand, {U gives a better lower bound than ED , 
if and only if 

E P0 H(W X ) 

< i {Ep^w^y^og W x {y) - log Wp(y )) 2 - I 2 (p;W)) . 

Therefore, although R — rna x p I(p;W) is small enough, the 
relation between bounds G3 and ED is not clear. 

Proof: By using a Taylor expansion, we obtain the 
approximations: 

ip(s\W,p) = I(p ; W)s + J(p; W)s 2 
<i>(t\W,p) = - I(p ; W)t + J(p\ W)t 2 
^{s\W) S* I(p 0 ; W)s + (J(p 0 ; IE) + E Po H(W x ))s 2 , 


[5], Winter et. al.[ 6] and Cai & Yeung[17], and identification 
codes has been discussed by Ahlswede & Winter) 18]. Hence, 
several quantum extensions of the results presented here can 
be expected. Some has been obtained by the author. And some 
of them have appeared in the author’s textbook) 13]. Those not 
already presented will appear in a forthcoming paper. 
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Thus, 


max 

l>s>0 


= max 

l>s>0 


—i/j(s\W,p) + sR 
1 + s 

—I(p', W)s — J(p; W)s 2 + ( I{p ; W) + A )s 


1 + s 


A 2 


= max {-J(p-W)s 2 + As} = 
i>s>o 1 J 4J(p; W) 

max {— (j)(t\W,p) — tR} 

0>t> —1/2 

= max {l(p-,W)t — J(p;W)t 2 — (I(p-,W) + A)t\ 

0>t> —1/2 

A 2 

= max {-J(p;W)t 2 - Af} = —-—— 
o>t>-i /2 1 J 4:J(p;W) 


max 

s>0 


= max< 

s>0 


-l/)(8\W)+sR 
1 + s 

-I(Po ; W)a - (J(p 0 ; W) + F, po H(W x ))s 
1 + s 


s(I(po; W) + A) j 

1 + 8 J 

-max { —(J(p 0 ; W) + E Po H(W x ))s 2 + As} 

s>0 

A 2 


'4(J(p 0] W)+E Po H(W x ))' 


VIII. Conclusion 

We give several non-asymptotic formulas in identification 
code, channel resolvability, and wire-tap channel. Using these 
formulas, we give the achievable rate channel resolvability for 
the general channel, which had been an open problem. Also, 
we derived several asymptotic relations among divergence 
rates, capacities of identification code, and e capacities of 
channel resolvability. 

From these non-asymptotic formulas, we obtained lower 
bounds of error exponents of channel resolvability in the 
stationary memoryless setting. Moreover, we derived lower 
bounds of error probability and wire-tapper’s information in 
the stationary memoryless setting in wire-tap channel. 

Concerning the quantum setting, wire-tap channel has been 
discussed in the discrete memoryless channel case by Devetak 
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